Picture for Hanlin Tang

Hanlin Tang

RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

Add code
May 26, 2026
Viaarxiv icon

Rethinking Cross-Layer Information Routing in Diffusion Transformers

Add code
May 20, 2026
Viaarxiv icon

Shiva-DiT: Residual-Based Differentiable Top-$k$ Selection for Efficient Diffusion Transformers

Add code
Feb 05, 2026
Viaarxiv icon

RazorAttention: Efficient KV Cache Compression Through Retrieval Heads

Add code
Jul 22, 2024
Viaarxiv icon

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

Add code
Mar 05, 2024
Figure 1 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Figure 2 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Figure 3 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Figure 4 for EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Viaarxiv icon

MKQ-BERT: Quantized BERT with 4-bits Weights and Activations

Add code
Mar 25, 2022
Figure 1 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Figure 2 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Figure 3 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Viaarxiv icon

PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic

Add code
Aug 20, 2021
Figure 1 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 2 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 3 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 4 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Viaarxiv icon

On the geometry of generalization and memorization in deep neural networks

Add code
May 30, 2021
Figure 1 for On the geometry of generalization and memorization in deep neural networks
Figure 2 for On the geometry of generalization and memorization in deep neural networks
Figure 3 for On the geometry of generalization and memorization in deep neural networks
Figure 4 for On the geometry of generalization and memorization in deep neural networks
Viaarxiv icon

Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models

Add code
Apr 15, 2021
Figure 1 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Figure 2 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Figure 3 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Figure 4 for Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models
Viaarxiv icon

1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed

Add code
Apr 13, 2021
Figure 1 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Figure 2 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Figure 3 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Figure 4 for 1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Viaarxiv icon